Probability distribution of drawdowns in risky investments. 
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We study the risk criterion for investments based on the 
drawdown from the maximal value of the capital in the past. 
Depending on investor's risk attitude, thus his risk exposure, 
we find that the distribution of these drawdowns follows a 
general power law. In particular, if the risk exposure is Kelly- 
optimal, the exponent of this power law has the borderline 
value of 2, i.e. the average drawdown is just about to diverge. 



For repeated investments one may consider maintaining a 
fixed fraction / of one's capital in risky assets, while keeping 
the rest of the capital in risk-free securities. After the pioneer- 
ing work by Kelly [[lj it is generally believed that the optimal 
strategy is to choose the investment fraction /* which max- 
imizes the average growth rate of the logarithm of the capi- 
tal However, many economists and prudent investors 
would balk at this aggressive proposal, since this strategy pro- 
poses a unique recipe for all purposes, without considering the 
investor's risk profile. Traditional alternative to the Kelly in- 
vestment recipe is to select the investment fraction, maximiz- 
ing the expectation value of some investor-specific utility func- 
tion Js) . Unfortunately, this recipe leads to incorrect expec- 
tations, since for very broad distributions, such as log-normal 
distributions in multiplicative stochastic processes, the expec- 
tation value is dominated by an exponentially small fraction 
of outcomes, and is unlikely to be achieved after a reasonably 
large number of trials. In the past this common-sense obser- 
vation has caused persistent debates and was often leading to 
fallacious conclusions 

If the overall shape of utility function is of little relevance in 
determining the optimal long-term investment strategy, what 
investment property one should consider to distinguish be- 
tween aggressive and conservative investment strategies? In 
this work we systematically study a risk criterion based on 
the probability distribution of drawdowns of the capital mea- 
sured relative to its highest value in the past, which we refer 
to as drawdowns from the maximum. It is often quoted in the 
trading community that the probability of a given drawdown 
from the maximum is one of the most sensible parameters 
of an investment strategy Often investors identify their 
wealth as the highest achieved amount. Hence, at any time 
the current drawdown from the highest capital in the past 
gives a measure of investor's frustration, tests his strength of 
nerves and his faith in the ultimate recovery. 

The definition of the drawdown from the maximum is 
rather natural. Let W(t) to denote investor's capital as a 
function of time. Define W max (t) to be the overall maxi- 
mum of the capital up to this point in time: W max (t) = 
max t /< t W (t 1 ). The current drawdown from the maximum 
(DDM) D(t) is given by 



D(t) = W max {t)/W{t). (1) 

From this definition it follows that D(t) > 1 with equality 
realized only if the current capital is at its overall maximal 
value. 

Let us first find out the DDM probability distribution in 
a very general case where the investor's capital follows a 
discrete-time multiplicative random walk 

W(t+1) =e l{t) W(t). (2) 

In this expression a random number rj(t) is drawn at each 
time step t from a given probability distribution n(rj). As 
usual, it is easier to work with the logarithm of the capital 
h(t) — \nW(t), which performs an ordinary random walk 

h(t + i) = h(t) + r){t). (3) 

The logarithmic drawdown from the maximum (LDDM) 
LD(t) = In D(t) is simply given by 

LD(t) = max hit') - hit). (4) 
t'<t 

To the purpose of calculating the probability distribution 
function of LDDM let us divide the time axis into a sequence 
of time intervals during which h, nax (t) = \nW max (t) stays 
constant (see Fig. 1). 
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FIG. 1. Random walk h(t) = lnW(t) (solid line) and its 
maximal value up to time t, h max (t) (bold solid line). Between 
t n and tn+ij the maximal value h max (t) stays constant and is 
equal to h(t n ). 
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Each such interval starts at time t n when the walk is at 
the overall maximum of h(t) and ends at time t n +i when 
this maximum is surpassed and replaced by a new, higher 
one. The motion in each of the intervals (t n ,t n +i) can be 
viewed as a random walk with an open upper boundary at 
h(t„): the process ends (and the new one starts) when the 
walk leaves the interval (—00, h max ). Within a single interval 
without loss of generality we can set h max = and, conse- 
quently, LD(t) = —h(t). In order to find the distribution 
of drawdowns from the maximum we need first to calculate 
the time dependent density p(h,t) of the ensemble of random 
walks in the interval (— 00, 0) with an open upper boundary. 
The process starts at t = 0, when p(h, 0) = 8(h). Note that, 
since walks can leave the system, the probability of finding 
a walk within the interval (— 00, 0) (i.e. of finding the cur- 
rent maximum unsurpassed t time steps after it was realized), 
Ptot(t) = p(h,i)dh < 1, is not conserved. Open bound- 
ary conditions are equivalent to maintaining p(h, t) = for 
ft > at all times. 

The distribution of drawdowns from the maximum 
P(x)dx — Prob(s < LD < x + dx) measures the probability 
of finding a given logarithmic drawdown at an arbitrary time 
without any reference to the time t elapsed since the last max- 
imum. Therefore, P(x) is proportional to the time-cumulative 
density at the point h ma x — x = —x: P(x) ~ p(—x,t). 
Including normalization one gets 



P(x) = 



£ t "oP(-M) 



(5) 



Any ensemble density p(x, t) should obey the following in- 
tegral equation of motion 



p(x,t) 



p(x — rj,t— l)n(rj)dri 



(6) 



This equation expresses the density p(x,t) at time t in terms 
of the known density at the previous time step t — 1 and the 
probability distribution of jumps. It fixes p(x, t) for x < 0, 
while the definition of an open boundary random walk re- 
quires p(x,t) = for x > 0. The stationary probability dis- 
tribution of drawdowns from the maximum makes sense only 
for random walks with a positive (upwards) drift (77} > 0. 
Indeed, for a negative (downwards) drift the maximum re- 
alized in the beginning of the process is likely to be never 
surpassed. In this case as the walk drifts further and fur- 
ther down it samples larger and larger drawdowns so that 
the probability distribution of drawdowns never becomes sta- 
tionary. Using the equation of motion (^|) for p(h, t) one 
gets P(x) = A- 1 J2Zo P(~x, *) = A~ l (ZZ 1 J p(-x - v ,t- 
l)n(rj)dri + p(-x,0)) = J P(x + r))-n:(rj)dr) + A~ 1 S(x). Here 
A = X/t^o Jo P(~ x '' t)dx' is the normalization factor. 
Therefore, for x > one has 



P(x) = / P(x + r))n(r))dr). 



(7) 



Note also that in order to follow our definition of logarithmic 
drawdowns we had to change the sign in front of 77, compared 
to that in Eq. |(| It is a straightforward task to determine 
the asymptotic behavior of P(x) for x much bigger than the 
typical value of 77. In this case we can safely disregard that 



P(x) = for x < and plug the ansatz functional form 
P(x) ~ exp(— Fx) into the Eq. (?]. The parameter F > is 
then determined from the equation 



7t(tj) exp(— Trj)dri — 1. 



(8) 



One can show that this equation has at most one strictly pos- 
itive solution. In fact the sufficient and necessary condition 
for the existence of such solution is a positive upwards drift 
(rf) > plus a nonzero support of n(r/) for negative 77 < 0. 
Indeed, first one notices that the second derivative of V(r), 
where V(F) is the LHS of Eq. ^ with respect to F, is strictly 
positive, and F = is the obvious solution to V(T) = 1. The 
nonzero 71(77) f° r some 77 < guarantees that V(+<x) = +00. 
Since dV/dT\r=o = — (rj) < 0, the continuity of V(T) guaran- 
tees the existence of the positive solution of V(r) = I. The 
positive second derivative ensures its uniqueness. 

In order to get a better feeling of how the parameters of the 
jump distribution 71(77) determine F we consider two particular 
functional forms of the distribution 71(77). We first see what 
happens if 71(77) nas a binomial shape. For simplicity let us 
take a particular binomial distribution, where 77 = In A with 
probability p > 1/2, and 77 = — In A with probability 1 — p. In 
other words, with probability p one's capital is multiplied by 
A > 1, otherwise it is divided by A. For this distribution Eq. 
|§] reduces to p/y + (1 — p) y = 1, where y — A r . This quadratic 
equation has two solutions 771 = 1 and 772 = p/(l — p). For 
p > 1/2 (upwards drift condition) the second solution gives 
the desired positive 



hip — ln(l — p) 

iiiA ' 



(9) 



The other case we use to illustrate Eq. |^ is when 7r(rj) has a 
Gaussian shape 71(77) — (l/\/27r<r) exp(— (77 — p) 2 /2a 2 ). Then 
Eq. ^ can be rewritten as exp(— F(p — T(T 2 /2)) = 1. The 
unique nontrivial (r 7^ 0) solution, given by 



2/< 



(10) 



is positive, provided /1 > 0, i.e. the random walk has an 
upwards drift. 

The last equation can be also derived within a continuous 
time approach. Indeed, increments of a continuous-time ran- 
dom walk, taken at a discrete time intervals, necessarily have 
a gaussian shape so that Eq. ^ should hold in this case. An- 
other way to see this is to replace the integral equation (f?|) 
with the differential stationary Fokker-Planck equation 



dP(x) 
dx 



+ ■ 



d 2 P(x) 
dx 2 



0. 



(11) 



where fi, and a 2 are the drift velocity and the dispersion 
of the random walk. This equation has a solution P(x) = 
A -1 exp(— rx), where F is given by 



- continuous — 



2/< 



(12) 



The exponential distribution of logarithmic drawdowns 
LD = In D corresponds to the power law distribution of draw- 
downs themselves. To properly change variables one notices 
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that Prob(LD > x) ~ exp(-rx). Therefore, Prob(£> > y) ~ 
y~ r , and for the distribution of D one has P(D) ~ l/Z) 1+r . 

It is interesting to note that the mechanism by which draw- 
downs from the maximum acquire a power law distribution 
is similar to that of the multiplicative random walk pushed 
against the wall. This mechanism, which was first analyzed in 
a financial context by Levy & Solomon || and later studied 
in greater detail in |],|l(|, is rather simple. It is well known 
that the problem of finding a stationary distribution of a mul- 
tiplicative random walk drifting in the direction of a reflecting 
wall can be rewritten in terms of a Fokker-Planck equation for 
the logarithm of the observable variable with reflecting bound- 
ary condition at the position of the wall. The solution of this 
equation has the well known exponential (Boltzmann) form 
which, being rewritten in terms of the variable, subject to the 
multiplicative noise, becomes a power law. In our analytical 
approach to the problem of drawdowns from the maximum, 
the current maximal value of the capital serves as an absorb- 
ing upper wall for a random walk (once the walk surpasses 
the current maximum, the value of the maximum has to be 
updated, which can be looked at as simply taking another 
representative of the ensemble). Of course, the Fokker-Plank 
equation with an absorbing boundary, unlike with a reflecting 
boundary, does not allow for a stationary solution. However, 
as was demonstrated above, the equations for cumulative (in- 
tegrated over time) distributions are identical in both cases. 
That is why it should not be surprising that Eq. [l^ of this 
paper is identical to the Eq. 10 of Ref. fllQ[ |, which deter- 
mines the exponent of the stationary power law distribution 
of multiplicative random walk pushed against the hard wall. 

Now we are in a position to derive the distribution of the 
drawdowns from the maximum for the investor, following a 
constant investment fraction strategy jl]-^j . In such a strategy 
the investor invests a fraction of his capital in one risky asset 
while keeping the remainder safely in risk- free securities. At 
each discrete time step the investor sells or buys the correct 
amount of shares of risky asset to adjust the current value 
of his asset holdings to precisely the fraction / of his total 
capital. This investment fraction / (leverage factor if / > 1) 
is the sole parameter defining the strategy. In this work we 
do not allow the change of the reinvestment time interval (the 
discrete time step at which the investor adjusts his asset hold- 
ings). Also for simplicity we set the risk- free interest rate to 
zero. The generalization to a more general situation is rather 
straightforward, but makes our final formulas less transpar- 
ent. The evolution of investor's capital for a fixed investment 
fraction strategy is given by a multiplicative random walk 

W(t+l) = W(t)(l~f + fe^). (13) 

In this expression the random variable 77(f) describes the mul- 
tiplicative fluctuations of the price p(t) of the risky asset: 
p(t + 1) = e v ^p(t). The results for the distribution of draw- 
downs from the maximum derived above apply to the fixed 
investment fraction strategy if one uses /-dependent random 
walk variable r]f = ln(l-/ + /e") so that e v t = l-/+/e" (t) . 
If / = 1, i.e. the whole capital is invested in risky asset, 
r/f — r\ and the whole capital just follows the multiplicative 
random walk of the risky asset's price. It is clear that by 
selecting a smaller investment fraction / the investor reduces 
the probability of significant drawdowns from the maximum, 
so that Tf is a decreasing function of /. 



The results are especially straightforward in the case when 
the logarithm of the stock price follows a continuous gaussian 
random walk with drift velocity fi and dispersion a 2 . As it was 
shown for instance in jij, for a gaussian Tv(rj) the logarithm of 
the capital subject to a fixed investment faction strategy has 
the drift velocity fif and dispersion a 2 given by 



W = A* 



2 ) 1 2 
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2 j-2 



(14) 
(15) 



The exponent of the power law distribution of drawdowns 
P(D), Tf — F f + 1 = 2fj,f/a'f + 1, in this case is given by 



gaussian 



2/4- 



(16) 



The bigger is this exponent, the safer is your investment from 
large drawdowns. Of course, the stationary distribution of 
drawdowns is limited to the case when fj,f > 0. For the 
exponent 77 this corresponds to the condition Tf > 1, i.e. 
normalizable P(D). 

It was suggested by Kelly that for the long term invest- 
ment the optimal fixed investment fraction strategy would be 
the one maximizing /j,f. For the Gaussian ir(rf) the invest- 
ment fraction /* in this Kelly-optimal strategy is given by 
/* = 1/2 + /i/<r 2 . Indeed, this is what one gets from maxi- 
mization of the drift velocity fif given by Eq. [li]. It is inter- 
esting to note that for the Kelly optimal strategy the DDM 
distribution exponent has a superuniversal value 



(17) 



This result is not restricted to gaussian n(rj). It is straight- 
forward to demonstrate that it holds at the Kelly opti- 
mum for any n(rj). Indeed, by definition of the Kelly op- 
timal strategy it maximizes the growth rate of the loga- 
rithm of the capital given by [if = (ln(l — / + fe v )). There- 
fore, = d^f/d_f\ r = ((e"-l)/(l-r + /'e')> = (1 - 
((1 — /* + f* eV ) From this equation it follows that 

at the Kelly optimum one has (e""/) = {e~ MW*+r<=")) = 
((1 - f* + /"e")- 1 ) = 1, i.e. T f . = 1 (r r = I> +1 = 2) 
is the solution to (H). That proves that for an arbitrary dis- 
tribution n(r)) precisely at the Kelly optimum /* the power 
law distribution of drawdowns has a superuniversal exponent 
r r = 2. 

Let us illustrate these results using an example of a risky 
asset, the price of which with equal probability p — 1/2 goes 
up by 30% or down by -24.4%. This is precisely the exam- 
ple of a hypothetical "red chip" stock that we used in [^] 
to illustrate the power of Kelly optimization. The stock it- 
self is doomed: its price is going down by roughly 1% every 
time step (typically at each time step the price is multiplied 
by VT 30 - 0.756 « 0.99). On the other hand, since average 
return of 2.8% of this stock is positive, following the Kelly- 
optimal fixed investment fraction strategy with /* ~ 0.3825 
results in a positive growth rate of investor's capital of some 
0.53%. We have simulated the outcomes of investment pro- 
cess with different investment fractions both above and be- 
low Kelly optimal. Fig. 2 displays the time dependence of 
investor's capital for / = 0.1, 0.38, 0.7, and 1. 
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FIG. 2. The evolution of the capital invested in the hypo- 
thetical risky asset described in the text at different invest- 
ment fractions / = 0.38, 0.7, 0.1, 1 (from top to bottom by the 
last point IU(1000) in the time series) . 

It is clear from this figure that the final capital after 1000 
time steps grows as / is increased from to 0.38 and starts to 
go down above 0.38 so that for / > 0.765 the typical growth 
rate becomes negative and the investor ends up loosing money. 
This is illustrated in Fig. 2 on the example of the / = 1 curve, 
where the investor trusted his whole capital to the stock and 
is going down together with this doomed stock. 

In Fig. 3 we plot the probability distributions of the draw- 
downs from the maximum for different investment fractions 
in this stock. 
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FIG. 3. The probability to have a drawdown bigger than 
D as a function of D for the same hypothetical risky asset 
as in Fig. 2. The power law exponent r/ is systematically 
decreasing with the investment fraction / ranging from 0.1 to 
0.7. The exponent of the P{D(t) > D) at the Kelly optimum 
/* = 0.3825 is in excellent agreement with the theoretical 
prediction r/» = 1. 5 x 10 8 data points were used to make 
histograms in this plot. 

The trend of increasing probability of large drawdowns as 



/ is increased can be clearly seen. The P(D) calculated at the 
Kelly optimal fraction /* ~ 0.3825 is in agreement with our 
theoretical prediction of T f* = 1. 

To illustrate our results on a more concrete example we an- 
alyzed the time dependence of the capital invested in S&P500 
index during the year of 1996, using half hourly data pro- 
vided by Olsen&Associates. In our hypothetical "investment" 
we selected and maintained on half-hour basis three different 
fixed leverage factors: / = 5, 10, and 15. Any / > 1, of 
course, can be realized only if such a leverage ratio is allowed 
(this is the case e.g. using derivatives such as futures). The 
resulting drawdown distributions are shown in Fig. 4. 




FIG. 4. The probability to have a drawdown bigger than D 
as a function of D for a leveraged investment in the S&P500 
index during the year of 1996. The investment (leverage) 
factors are / = 5, 10, 15 from left to right. 

The largest leverage factor / = 15 approximately corre- 
sponds to the Kelly optimum for this asset under the condi- 
tion of zero risk-free interest rate. 

In summary, we have proposed and studied a risk mea- 
sure for repeated investment games. We see that, unlike the 
traditional expected utility approach describing the risk in 
terms of a single number, we need a whole function to judge 
if the risk is worth undertaking. Under general conditions this 
function - the distribution of drawdowns from the maximum 
- has a power law shape. Kelly's optimal solution represents 
the most aggressive strategy, since the power law barely gives 
a finite expectation value of drawdowns (the exponent be- 
ing 2). More risk-adverse investors can resort to sub-optimal 
strategies in the Kelly sense, where large drawdowns are con- 
siderably tamed. However, even those "safer" strategies are 
not absolutely free from the risk: since power laws do not 
have built in cutoffs, ruins (large drawdowns) can in principle 
arrive but are much less likely. 
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